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ODOMETERS AND TOEPLITZ SYSTEMS REVISITED IN THE 
CONTEXT OF SARNAK’S CONJECTURE 

TOMASZ DOWNAROWICZ AND STANISLAW K AS JAN 


Abstract. Although Sarnak’s conjecture holds for compact group rotations 
(irrational rotations, odometers), it is not even known whether it holds for all 
Jewett-Krieger models of such rotations. In this paper we show that it does, 
as long as the model is at the same a topological extension. In particular, we 
reestablish (after |AKL| ) that regular Toeplitz systems satisfy Sarnak’s con¬ 
jecture, and, as another consequence, so do all generalized Sturmian subshifts 
(not only the classical Sturmian subshift). We also give an example of an 
irregular Toeplitz subshift which fits our criterion. We give an example of a 
model of an odometer which is not even Toeplitz (it is weakly mixing), hence 
does not fit our criterion. However, for this example, we manage to produce 
a separate proof of Sarnak’s conjecture. Next, we provide a class of Toeplitz 
sequences which fail Sarnak’s conjecture (in a weak sense); all these examples 
have positive entropy. Finally, we examine the example of a Toeplitz sequence 
from |AKL| (which fails Sarnak’s conjecture in the strong sense) and prove 
that it has positive entropy, as well (this proof has been announced in [AKL] j. 
This paper can be considered a sequel to IAKLI , it also fills some gaps of M- 


1. Introduction 

This note results from the discussions the authors held with Mariusz Lemanczyk 
about topological (in particular symbolic) models of odometers in the context of 
Sarnak’s conjecture. We refer the readers to the recent paper of Lemanczyk et al 
lAKL] for results concerning Sarnak’s conjecture for Morse systems, where some 
indispensable facts concerning regular Toeplitz subshifts were obtained partly in¬ 
dependently and partly jointly, and inspired this work. 

First of all, it has been discovered that although regular Toeplitz subshifts are the 
best known symbolic models (the precise meaning of a “model” will be given in the 
next section) of odometers, there are also other possibilities, the existence of which 
was not fully realized before. In the preceding work of the first author m there are 
some erroneous statements about regularity, and this note fixes them; there exist 
irregular Toeplitz models of their underlying odometers. For completeness, we also 
give examples of models which are not even Toeplitz (in fact topologically mixing). 
Sarnak’s conjecture can be shortly proved for both regular and irregular Toeplitz 
models, as long as they are topological extensions of the modeled odometers. The 
method uses the sole property of the Mobius function, that it is orthogonal to any 
periodic sequence, otherwise it relies on an easy spectral argument. As a digres¬ 
sion, we apply a similar spectral method (but a different property of the Mobius 
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function) to prove Sarnak’s conjecture for isomorphic extensions of other equicon- 
tinuous systems, in particular for generalized Sturmian subshifts. Our method fails 
for other (e.g. topologically weakly mixing) models of equicontinuous systems and 
the case remains a challenge. Nonetheless, we are able to successfully apply it to 
our topologically mixing example. By this occasion we also give relatively simple 
examples of Toeplitz subshifts which massively fail Sarnak’s conjecture (at many 
points including Toeplitz sequences), showing that just being a union of periodic 
sequences is insufficient. We also copy from |AKLj an example in which a Toeplitz 
sequence fails the conjecture so bad that the limit inferior of the absolute values of 
the averages is positive. As one might expect, the counterexamples have positive 
entropy, of which we give detailed proofs (which are skipped in |AKL | ). 

2. Preliminaries on Toeplitz systems 

The notation and terminology of this section is consistent with that of [D], where 
we also refer for references to earlier papers. To make this paper selfcontained, the 
most crucial definitions will be repeated. By a scale we mean an increasing sequence 
of positive integers H = {pk)k> 1 such that Pk\Pk+i for every k. The adding machine 
with scale H is the topological group G obtained as the inverse limit of the cyclic 
groups Z Pk = Z/pfcZ: 

G = HmZ 

k 

By an odometer we will mean the topological dynamical system (G, r) where r 
is the homeomorphism g H > g + 1 of G into itself, where 1 = (1,1,...) is the 
topological generator of G. The odometer is minimal, equicontinuous and zero¬ 
dimensional, and the conjunction of these three properties characterizes odometers 
among topological dynamical systems. Odometers are uniquely ergodic, with the 
Haar measure A being the unique invariant measure. For us, an odometer also 
means the ergodic system (G, A, t). By a (Jewett-Krieger) model of an ergodic 
system we will understand any strictly ergodic (minimal and uniquely ergodic) 
topological dynamical system isomorphic (for its unique invariant measure^) to 
the given ergodic system. Note that there may (and usually do) exist mutually not 
conjugate models of the same ergodic system. For instance, an odometer is a model 
for itself, but there are other models as well, for example some symbolic systems 
over finite alphabets (which are never equicontinuous, so they cannot be conjugate 
to the odometer). 

We will be mostly concerned with Toeplitz system^ understood slightly more 
generally than usually, i.e., we will not require that they are symbolic, nonetheless, 
all our examples will be symbolic. Toeplitz systems can be defined in a multitude of 
ways, which is captured in the theorem below. The relevant definitions and proofs 
can be found in m (for the reader’s convenience, the definitions are also given in 
footnotes). 

Theorem 2.1. The following conditions are equivalent for a topological dynamical 
system (X,T). A system satisfying them is called a Toeplitz system. 


^Convention: isomorphism is measure-theoretic while conjugacy is topological. 

2 In the literature they are often called “Toeplitz flows”; we find this notation confusing, as 
they are discrete time systems. 
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(1) (X, T) is the orbit closure of a regularly recurrent pomj^J - 

(2) (X, T) is a minimal almost 1-1 extensioi^of an odometer (G, r); 

(3) (X, T) is a semicocycle extensioi |^] of an odometer (G,t). 

The conditions (1) and (2) are additionally related: 

Theorem 2.2. If 7r is the almost 1-1 factor map from a Toeplitz system ( X, T) to 
an odometer (G, r) then x £ X is regularly recurrent if and only i/7r -1 (7r(x)) = {x}. 

In Theorem 2.1[ the odometer (G, r) appearing in (2) is the maximal equicon- 
tinuous factor r of (X, T). The adding machine G appearing in (3) (with the action 
of t) is the maximal equicontinuous factor of (X , T) if and only if the semicocy- 
cle has the additional property of being invariant under no rotatzon^J Otherwise, 
the maximal equicontinuous factor of (X, T) acts on a quotient group of C0 and 
(G, t) is not even a factor of (X, T) (let alone maximal equicontinuous). We can 
strengthen condition (3) as follows: 

Theorem 2.3. Every Toeplitz system is conjugate to a semicocycle extension of 
its maximal equicontinuous factor. 

An important class of Toeplitz systems, called regrz/a?|^J is described by the con¬ 
ditions given below: 

Theorem 2.4. Let tt be the almost 1-1 factor map from a Toeplitz system (X, T) 
to an odometer (G, r). The following conditions are equivalent: 

(1) The set of points g £ G such that ffir ~ 1 * 3 * * * 7 (g) = 1 has full measure A; 

(2) The set of regularly recurrent points in (X, T) has full measure for every 
invariant measure on X; 

(3) The set of discontinuities Df of the relevant semicocycle f on G has A 
measure zero. 

The following is obvious by (1): 

Theorem 2.5. A regular Toeplitz system (X, T) is strictly ergodic and it is isomor¬ 
phic to its maximal equicontinuous factor (G,r), and the isomorphism is provided 
by the almost 1-1 topological factor map n. 

In particular, a regular Toeplitz system is a model for the underlying odometer. 
Unless 7T is 1-1 everywhere, (X, T) is not conjugate to (G, t). 


3 A point x is regularly recurrent if, for every open U =) x. the set of return times to U contains 
an arithmetic progression nZ. 

'[’lie factor map 7r : X —> G is almost 1-1 if tt~ ] (n(x)) = {x} holds on a residual subset of X. 

3 A semicocycle is a function / : G -x K into a compact space, which is continuous on a 

residual subset of G. Let F denote the multifunction defined by the closure of the graph of /. Let 
Xp = {x S /G : (3g £ G) (Vn £ Z) x(n) £ F(g + n) }. This set is closed and shift-invariant, and 
has a unique minimal subset which we denote by Xf. Xj with the action of the shift is called the 

semicocycle extension (associated with /). 

®Any other equicontinuous factor of (X,T) factors through (G,r). 

7 A semicocycle / on an adding machine G is invariant under no rotations if F o (■ + g) = 

F 9 = 0. 

H 'fliis quotient group is G/H , where H = [g : F o (■ + g) = F\. 

®The coincidence of this term with “regular recurrence” is incidental: the latter term was 
coined independently from those who invented “regular Toeplitz systems”. 
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In ID] Theorem 13.1 (5) and (6) it is claimed that a strictly ergodic Toeplitz sys¬ 
tem isomorphic to its maximal equicontinuous factor is necessarily regular. More¬ 
over, it is claimed that strict ergodicity need not be assumed if an isomorphism 
exists for some invariant measure. Unfortunately, the statement (even with strict 
ergodicity assumed) is false. Relevant counterexample is provided in the following 
sections. 

Let us return to the general case. In symbolic systems regularly recurrent points 
are called Toeplitz sequence^] Toeplitz subshifts (and Toeplitz sequences) were 
the first examples of almost 1-1 extensions of odometers and they are the most 
important. In this class we have an additional simplification: 

Theorem 2.6. Every Toeplitz subshift over a finite alphabet A is conjugate to a 
semicocycle extension of its maximal equicontinuous factor with the semicocycle 
taking values in A. 

The advantage of having the finite-valued semicocycle is that its set of discon¬ 
tinuities Df C G is then closed. The elements of Toeplitz subshifts have specific 
structure, as described below. 

Definition 2.7. Let (X, T) be a Toeplitz subshift- and let ( G,t ) be the maximal 
equicontinuous factor of (X, T). For x £ X and p £ N we denote 

Per p (x) = {n £ Z : (Vm £ Z) x(n) = x(n + mp)}, Aper(x) = Z \ Per p (x). 

peN 

and call these sets the p-periodic part and aperiodic part, respectively. 

The union of periodic parts will not change if we unite over a scale {pk)keN of 
the adding machine G (then the union is increasing). It is important to know that 
Per Pfc (x) and Aper(x) depend only on 7r(x). Clearly, x is a Toeplitz sequence if and 
only if its aperiodic part is empty. 

Recall that for a set A C Z the forward and backward densities of A are de¬ 
fined as dens + (A) = lim„ j#(An [0, n— 1]), dens'(A) = lim n L#(An [—n, —1]), 
respectively (provided the limits exist) and in case they coincide we call them the 
density of A and denote by dens(A). 

Theorem 2.8. Let (A', T) be a Toeplitz subshift. Then, for every p £ N, dens(Per p (x)) 
exists and is constant throughout X. Let d = 1 — sup fc dens(Per p) .(x)). Then 

(1) d = \{D f ); 

(2) dens(Aper(x)) < d for every x £ X; 

(3) dens(Aper(x)) = d for p-almost every x £ X, for every invariant measure 
p on X. 

In particular, regularity of (A, T) is equivalent to d = 0, and to dens(Aper(x)) = 0 
at every point. 

Let (A, T) be an irregular Toeplitz subshift. Consider the set of such points 
x £ X that dens(Aper(x)) = d (by (3) above, this set has full invariant measure). 
Since Aper(x) is constant throughout every fiber of 7r, our set equals 7r —1 (£7) for 
some E C G. For x £ tt~ 1 (E) we enumerate Aper(x) = {n,}i e z assuming that the 
sequence {nf) is increasing and letting no be the smallest nonnegative element of 

1 ( 'Regular recurrence takes on the form (Vn S Z)(3p S N)(Vm £ Z) x(n) = x(n + mp). 
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the sequence. We let y x = (:c(ni)), g z £ A z and call it the aperiodic readout of x. 
For g £ E we let Y g = {y x : x £ tt ~ 1 (g)j- 

Definition 2.9. We say that the Toeplitz subshift (X, T) satisfies the condition 
SAR ( same aperiodic readouts,) ifY g is the same for every g £ E. We then denote 
the common space Y g by Y. 

It is easy to see that in this case Y is closed and shift invariant. The following 
theorem plays the crucial technical role in most of our examples (for proofs see 03): 

Theorem 2.10. Let (X, T) be an irregular Toeplitz subshift satisfying the condition 
SAR. Let Tf be the skew product acting on G x Y given by 

T f(g,y) = {T(g),s 9 (y)), 

where S 9 equals the shift or the identity; depending on whether g £ Df or not, 
respectively. Then 

(1) There is a bijection between invariant measures of (X,T) and invariant 
measures of (G x Y,Tf); 

(2) Every invariant measure on (X, T) is isomorphic to its corresponding in¬ 
variant measure on the skew product; 

(3) Every invariant measure on the skew product has marginals A on G and 
some shift-invariant measure v on Y; 

(4) Every shift-invariant measure is on Y appears as the marginal for at least 
one invariant measure on the skew product (for example for A x is); 

(5) The entropy of the skew product with respect to an invariant measure equals 
d times the entropy of the corresponding marginal on Y. 

(6) The topological entropy of the skew product (which equals the topological 
entropy of (X,T)) equals d times the topological entropy of the shift on Y. 

We will refer the the above facts several times. 

3. Preliminaries on the Mobius function and Sarnak’s conjecture 
The Mobius function denoted by fi is defined on positive integers as follows 

! 1 if n = 1, 

0 if n has a repeated prime factor, 

(—l) r if n is a product of r distinct primes. 

This function has been introduced by A. F. Mdbius in [Mj to obtain inversion for¬ 
mulas for arithmetic functions [CDMj . It plays an important role in number theory. 
The reader is referred to the rich literature in that area for more information, let 
us quote only two fundamental monographs: e, m- 

Definition 3.1. Let f{n) and rj(n) be two bounded complex-valued sequences over 
N. We say that these sequences are uncorrelated if 

n 

hm-V'((i)»;(i) = 0. 

n n ' 

Z— 1 

One of the intriguing properties of the Mobius function is its apparent random¬ 
ness in the distribution of its values. It is well known that the forward density of 
square-free numbers (i.e., of the set {n : \/J.(n)\ = 1}) exists and equals \ (see [N], 
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Thm 21.8 and the following Corollary). On the other hand, the densities of posi¬ 
tive and negative values are equal implying that p is uncorrelated to the constant 
sequence (see [P], Thm 5.1). 

Moreover, it is uncorrelated to any periodic function (an elementary proof can 
be found in ELI), a fact which is connected with the laws of the distribution of 
primes along arithmetic progressions (see [Sal ). Let us remark that a more detailed 
analysis of this phenomenon, more precisely, of the behavior of the partial sums of 
the Mobius function, is an important area of study, connected to many fundamental 
number theoretical problems, see e.g. classical works [Wl . ISchl and more recent 
papers |RR| . m- 

Sarnak ISa) conjectures that p is uncorrelated to any sequence obtained by read¬ 
ing any continuous function along any orbit in any topological dynamical system 
with topological entropy zero, as follows: 

Conjecture 3.2. Let (X,T) be a topological dynamical system with topological 
entropy zero. Let f : X —> C be a continuous function. Fix an x £ X and let 
f(n) = f(T n x) (for n > 1). Then £ and p are uncorrelated. 

The conjecture is known to hold for relatively few types of dynamical systems, 
in particular for odometers, irrational rotations, nilsequences |GT] . horocycle flows 
[BHZ] See also [Bl], [B], fOT) . [G], [KL], (LS ], [MR] , for other results. We remark, 
that validity of Sarnak’s conjecture for odometers follows directly from the fact that 
the Mobius function is uncorrelated to any periodic sequence. The validity for irra¬ 
tional rotations can be proved by a criterion from jBSZj or by completely different 
property of the Mobius function, discovered by Davenport [Da] . It is a folklore fact 
that the conjecture holds for the classical Sturmian subshift; the proof uses heavily 
the fact that this subshift has complexitjj^] c(n) = n +1 ( |MHj ). Recently, in AKLj 
Sarnak’s conjecture has been proved for some Morse subshifts. Every continuous 
function on such a system decomposes as the sum of a function depending on the 
Toeplitz factor and an orthogonal one. Thus the method relies on two ingredients: 
for continuous functions orthogonal to the Toeplitz factor some specific spectral 
and disjointness methods are used. To handle the other ingredient the authors 
simply prove the conjecture for regular Toeplitz subshifts. In this note we extend 
the latter proof (in fact, we only notice that essentially the same proof applies) to 
a class slightly larger than regular Toeplitz systems, that of isomorphic extensions 
of compact monothetic group rotations. This includes some not necessarily regular 
Toeplitz systems and generalized Sturmian subshifts. 

4. Sarnak’s conjecture for isomorphic extensions 

The following fact has been observed jointly by the first author and M. Lemanczyk. 

Theorem 4.1. Let (X, T) and (Y, S) be strictly ergodic topological dynamical sys¬ 
tems, with invariant measures p and v, respectively, and let i t : X Y be a 
topological factor map which is, at the same time, an isomorphism. If Sarnak's 
conjecture holds for (Y, S ) then it also holds for (X,T). 

Proof. Let / : A' —» C be continuous. Then f £ L 2 (p) and /' = / o7r -1 £ L 2 (v) (/' 
is defined u-almost everywhere on Y). Since C(Y) is dense in L 2 (y), there exists a 
continuous g' : Y —» C such that f \ f — g'\ 2 dv < e 2 , hence f \ f — g'\ du < e. The 

1 1 t he number of words of length n in the subshift 
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function g = g' o tx is continuous on X and J \f — g\ dg < e. Because in strictly 
ergodic systems every point is genericJ^J we have lim„ £ \f( Tix )~9(T ix )\ < e, 

for any x € X. Now, we write 


n 

n 


< 


1 

n ' 


n 

-Y,\f( Tix )-9(T l x)Mi)l 

n 


The first average on the right hand side equals | ^ Y^i=i 9 '(S * l y)n(i )\, where y = 
7r(x), and is small for large n, because Sarnak’s conjecture holds on (' Y,S ). The 
last average does not exceed, for large n, the arbitrarily small e. Thus the left hand 
side tends zero with growing n. □ 


In lAKLI the reader will find a slightly different statement, in which (Yj S) is 
assumed coalescen^] and the assumption that the isomorphism between (X, T ) 
and (Y. S) is realized by the same topological factor map 7r is dropped (it is then 
fulfilled automatically). Recall that odometers and other ergodic group rotations 
are coalescent. 

We now draw conclusions concerning particular types of topological dynamical 
systems. It seems that items (lb) and (2) below are new. Notice that for the 
classical Sturmian subshift we have obtained a new proof not relying on the exact 
complexity. 


Corollary 4.2. Sarnak’s conjecture holds for: 


( 1 ) 

( 2 ) 


regular semicocycle extensions of any minimal equicontinuous systemin 
particular 

(la) regular Toeplitz systems (see also |AKLj ): 

(lb) generalized Sturmian subshift^\ 

some irregular Toeplitz subshifts as in the Example 5.1 below. 


Proof. To be absolutely clear, let us argue why does the conjecture hold for minimal 
equicontinuous systems. By the Halmos-von Neumann Theorem, every such system 
is uniquely ergodic and the space L 2 (yj) is spanned by (at most countably many) 
continuous eigenfunctions. Thus every continuous function can be approximated 
in L 2 (p) (hence also in L 1 (p)) by a finite sum of continuous eigenfunctions. Now, 
by an argument as in the preceding proof, it suffices to verify the conjecture for 
continuous eigenfunctions. But every such function arises as a continuous function 
defined on either an odometer (if the eigenvalue is rational) or an irrational rotation 
(otherwise). □ 


1 fulfills the ergodic theorem for every continuous function 

1 A very endomorphism from the system to itself is an isomorphism 

1 ^Notice that semicocycle extensions can be as well defined on any strictly ergodic system, not 
necessarily on an odometer. Regularity means that the set of discontinuities of the semicocycle 
has measure zero. 

1 ,r> A classical Sturmian subshift is obtained as the semicocycle extension of the irrational ro¬ 
tation by an angle a, where the semicocycle is precisely the characteristic function of [0,aj. In 

generalized Sturmian subshifts the semicocycle is admitted characteristic function of any nonde¬ 
generate subinterval or even a finite union of intervals. 
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5. Examples of models of odometers 

Example 5.1. There exist irregular Toeplitz subshifts which are strictly ergodic and 
isomorphic (via the same topological factor map) to their maximal equicontinuous 
factor odometers (for these Sarnak’s conjecture holds). 

Sketch of the construction. An explicit example of such a system is generated by 
the (unilateral) Toeplitz sequence described below. 

Pick a block B\ = 000...01000...0 of some length tt consisting of zeros but one 
symbol 1 (somewhere). We place this block yq-periodically (for some = Pi > tt, 
we also let qi = pi) (see Figure 1). 

0100 *** 0100 *** 0100 *** 0100 *** 0100 *** 0100 *** 0100 *** 0100 *** 0100 *** 0100 ... 

FIGURE 1. On this figure B \ = 0100, tt = 4 and p \ = 7. 

The unfilled places (the stars) come in blocks of length Q\= q\—r\. We pick a 
block £?2 of some length r 2 Qi, consisting of zeros but one symbol 1 . We write this 
block into r 2 consecutive empty blocks and repeat ^-periodically, where p 2 = <? 2 Pi; 
for some g 2 > r 2 (see Figure 2). 


0100000 0100100 0100000 0100***0100***0100***0100000 0100100 0100000 0100... 


FIGURE 2. On this figure B2 = 000100000, r 2 = 3 and g 2 = 6. 


Now the unfilled positions come in clusters of g 2 — r 2 blocks of length qi — r±. 


We continue in this manner: in step k + 1 we use a block Bk+i consisting of all 
zeros but one symbol 1, whose length equals r/.+i (a freely chosen number) times 
Qk = (pi — rr)(p 2 — f 2 ) • ■ ■ (Pk — fk) (the number of unfilled positions in [0 ,pk — 1] 
in the so far constructed sequence), we use this block to fill all unfilled places in 
[0, ru+iPk — 1], then we repeat it with a period Pk+i = qkPk for some freely chosen 
qk > fk- Two more details must be taken care of: the products IIfcLi(l — ^ 7 ), 
representing the density of unfilled positions after step N, must converge to a 
number d > 0. The second requirement is that for each k the symbol 1 appears in 
the future blocks Bk' ( k 1 > k) at positions whose remainders modulo Qk assume 
every possible value infinitely many times. With such an arrangement it is not very 
hard to see that: 

(1) the generated (bilateral) Toeplitz subshift (X, T) is an almost 1-1 extension 
of the odometer (G,t) with scale H = ( pk ); 

(2) (X, T) is irregular and satisfies the condition SAR with the space Y of 
aperiodic readouts consisting of all { 0 , l}-valued sequences having at most 
one symbol 1 . 


Clearly, Y supports only one invariant measure v , the pointmass at the fixpoint 
(...000...). Theorem 2.10 (l)-(3) implies that (X,T) is uniquely ergodic, the 
unique invariant measure p is isomorphic to the only measure with marginals A 
and v, which is A x v. Since v is supported by one point, the factor map 7 r provides 
an isomorphism between p and A, as required. ■ 
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There exist strictly ergodic systems (also subshifts) isomorphic to an odometer 
(G, r), yet w hose maximal equicontinuous factor (G 7 , r 7 ) is a proper factor of (G, r) 
and Theorem 4.1 does not apply to such systems. For G which is not simplify such 
examples are easily obtained with G 7 being an adding machine and the system is an 
almost 1-1 extension of (G 7 ,r 7 ), while the remaining eigenvalues of G are realized 
by discontinuous eigenfunctions (see EE3). There exist also models for which the 
maximal equicontinuous factor is trivial (hence all eigenvalues of (G, t) are realized 
by discontinuous eigenfunctions and the system is topologically weakly mixing). 
Such examples can be produced for all odometers, including the simple ones. Below 
we give an example with even stronger property of topological mixing f*^| 


Example 5.2. Given an odometer (G, r), there exists a strictly ergodic topologically 
mixing subshift (X,T) isomorphic to (G, r). 


Sketch of the construction. Since we will be dealing with subshifts, T will always 
denote the shift transformation, regardless of the domain. Let H = ( pk ) denote 
the scale of the odometer. Let ( X 0 ,T ) be a regular Toeplitz subshift with max¬ 
imal equicontinuous factor (G, t) , and let Xq £ Xq be a Toeplitz sequence. We 
will produce a sequence of topological conjugacies of (X 0 ,T), converging almost 
everywhere to an isomorphism with the desired subshift ( X , T). 

In step 1, choose some rq £ N and find all periodic repetitions in Xq of the central 
block Ci = x 0 [— rq,rq]. The period of the repetitions is some pi £ H. Choose q\ 
such that piqi £ H and choose every <71 th periodic occurrence of C\ in xq (avoiding 
the central one). Let us call these places 1-windows. Now comes the modification: 
within each 1 -window we shift the contents one position to the left (sending the 
leftmost symbol to the right end). The modification passes over, in an obvious way, 
to all elements of X 0 and is invertible if q\ is large enouglf^J This is our conjugacy 
: X 0 —» Xi between two Toeplitz subshifts. We denote by X\ the Toeplitz 
sequence $i(:ro). 

In step 2 we choose some r 2 and we denote G 2 = X\[— r 2 ,r'2]• We must take care 
of two details: C2 must be long enough to include several 1 -windows, moreover, its 
ends must fall far from the 1 -windows, for instance, approximately in the middle 
between two of them. In the future this will prevent an accumulation of the end- 
irregularities. We find all periodic occurrences of G 2 in Xi, and their period p-2 £ H. 
Next we choose some large <72 such that P 292 £ H and we mark every ( 72 th copy of 
G 2 in X2 (avoiding the central one) as 2-windows. Like before, we shift the contents 
of each 2 -window one position to the left, sending the leftmost symbol to the right 
end. This modification spreads naturally to a conjugacy <L >2 : X\ —» X 2 between 
Toeplitz subshifts. We let X 2 = $ 2 (aq) = $ 2 $i(®o)- 

We proceed in this manner infinitely many times, assuring that the densities of 
the positions affected by consecutive modifications (i.e., the ratios pk = ) are 

summable, and that the ends of the fc-windows fall, for every k' < k approximately 
in the middle between the a pair fc'-windows (see Figure 3). 


1,1 An odometer is simple when its scale is (p k ) for a prime number p. Simple odometers have 
no infinite proper factors, other do. 

17 It is known (E) that every aperiodic ergodic system has a topologically mixing strictly 
ergodic model. Here we provide a particular example. 

18 qi must be long enough so that the pi-periodic part of any x £ Xo can be determined by 
viewing any block of length p\ qi — 2r± — 1. 





10 


TOMASZ DOWNAROWICZ AND STANISLAW KASJAN 


FIGURE 3. The colors show the fc-windows (i.e., areas affected by 
original - black, 1-windows - red, 2-windows - green, 3-windows - blue 
(the next 3-window too far to be shown). The shift by one position of 
red within green, and that of green and red within blue are to small to 
be seen. 


Let Xao denote the sequence obtained as the limit of Xk (which exists because 
every position in xq is affected by the modifications at most finitely many times), 
and we let X be the shift orbit closure of Xoo and finally X, a minimal subset 
of Notice that the maps ’Ffc = ... $ 2 $! converge at every point of Xq 

whose every coordinate is affected at most finitely many times. We let 'll be the 
(partially defined) limit map on Xq. 

The following observations are standard and we skip their proofs: 

(1) (Xoo, T) is topologically mixing (can be checked directly by the definition). 

(2) T is defined almost everywhere on X'o and acts into X^. 

(3) For each k, on X we can define the following map ipk- in each element of 
X we can easily identify and reverse the consequences of the modification 
'I>i, $ 2 , * 1 > 3 ,..., (although we “invert” <!>/. in a seemingly wrong order, 
these inverses commute). Attention: the maps ipk are not precisely inverses 
of ’Ffc; they are defined on X^, not on Xk- 

(4) The limit of the above maps ipk (call it ip) is defined almost everywhere on 
Xoo for every invariant measure. 

(5) The range of ip is contained in X 0 and ip inverts 4/ wherever the latter is 
defined. 

The last four facts imply that (X^ ,T) is uniquely ergodic and isomorphic to 
(Xo,T). Since the minimal subset X C X^ supports an invariant measure, it 
supports the unique one, hence (X, T) is isomorphic to ( Xq,T ) and, additionally, 
minimal. Thus (X, T) is the desired example. ■ 

Although we have no general proof of Sarnak’s conjecture for all models of 
odometers, interestingly, it does hold in the above topologically mixing example. 
Again, we will only sketch the argument. 

Sketch of proof of Sarnak’s conjecture in Example \5.£\ Since every continuous 
function on X can be approximated in L 1 (/x) by a (continuous) linear combination 
of characteristic functions of cylinders corresponding to finite blocks, it suffices to 
verify the conjecture for such characteristic functions. We will do that for blocks of 
length 1, i.e., for the occurrences of the symbol 1. The argument for longer blocks 
is identical. 

Fix an e > 0 and let k be such that the density of places where x^ differs from 
x k (i.e., EV =k+\Pk') is less than e. Since Xk is regular Toeplitz, there are l and 
N such that the p;-periodic part occupies a fraction at least 1 — e in every block 
of length N appearing in any element of X*,. The problem we are facing is that 
the (slightly perturbed) pj-periodic part in x £ X does not come from a continuous 
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function on X any more, and we have no guarantee that all points realize the 
ergodic theorem for it. We need more subtle observations. 

We can assume that N is large enough so that 


»=1 


< ne 


for every {0, l}-valued pj-periodic sequence £ and every n > N. Let x £ X and 
consider the initial block x[l, n] for n > ^ (this is an arbitrary block B of length n 
appearing in x^). We will argue that there are (at most) three blocks B x ,B 2 , B :i 
appearing in Xk, with lengths summing to n, such that x[l,n] nearly equals the 
concatenation B\B 2 B 3 in the sense that the fraction of disagreements is at most 
2e. Once this is proved, we can write B\ = x'[l,m], B 2 = x"[m+ 1, to'], B :i = 
x'"[m! + l,n] for some x',x",x"' € Xk and m,m! € [l,n], m < m!, and then have 
the following estimation 


n 

i =1 


2 ne 


m 


m 


i= 1 i=m -\-1 


a;,,, (*)A i (*) 

i=m '-\-1 


m 


m 


m 


n 


m 


x, (* w*) + ^ x "\ i )^{ i ) 

i= 1 i =1 i—1 i— 1 i— 1 


< 10ne, 


because 

• the sums shorter than N contribute at most sN < sne (s < 4 is the number 
of such sums), 

• in each sum not shorter than N 

— x',x" or x'" can be replaced by a pz-periodic sequence and this will 
change the sum by less than e times the summing length, 

— once the above replacement is done, the absolute value of the sum does 
not exceed e times the summing length, 

• the sum of the summing lengths equals (5 — s)n. 

Clearly, this estimation ends the proof. 

So, it remains to break a block B of length n appearing in into at most three 
subblocks, as desired. We can think of B as of a block Bq appearing somewhere 
in Xk, affected by some finitely many modifications < J > z c+ i,..., If B is entirely 
contained in a /C-window without its endpoints, then 4>x shifts the entire contents 
one position to the left, so that the result is the same as if we started from the block 
Bq lying in Xk one position to the left with respect to B 0 and pretended that 
did not affect it. In this manner, we can ignore all such cases and move on to the 
largest index (and denote this one by I \) for which only part of B is affected by 
that is, B contains an endpoint of a Iv-window. Now there are three possibilities: 

(1) B intersects two or more K- windows, 

(2) B contains one or both endpoints of just one A'-window. 

In case (1) the fraction of modifications introduced by in B is at most 2 pk (and 
Pk' for earlier modifications with k' = k + 1,..., K — 1), so B differs from Bq on a 
fraction of at most 2e places and there is no need to partition it (we put B 1 = Bq, 
there is no B 2 or B 3 ). In case (2) we cut B at the endpoints of the A^-window. 
This produces two or three subblocks B^^B^Bq. Notice that affects only one 
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of these subblocks and only by shifting it “in one piece”. So, we only need to see 
how much each subblock is affected by the earlier modifications. Recall that for 
each k' < K the endpoints of the fc'-windows fall approximately Pk '^ k ' places away 
from the endpoints of the di-window. This implies that if a modification does 
affect a subblock B' i (i = 1,2 or 3), then the fraction of the modifications in this 
subblock is at most 2 ■ So each B[ differs from a subblock of Xk on a fraction of 
at most 2e places. This completes the argument. □ 


6. Toeplitz sequences which fail Sarnak’s conjecture 

We will say that sequences £ and 77 are weakly (resp. strongly) correlated if the 
upper (resp. lower) limit of ^ Xu=i is positive. 

6.1. Weak failure. 


Example 6.1. There exists an irregular {—1,1 }-valued Toeplitz subshift X such 
that some x £ X are strongly correlated with fi. The set of such points x is dense 
in X. Moreover, weak correlation holds on a residual subset of X. 


The example is very simple, once the general construction of Toeplitz systems is 
understood: Consider an irregular Toeplitz subshift satisfying the condition SAR 
and such that Y, (the space of aperiodic readouts) is the full shift on two symbols 
{— 1 , 1 } (we skip the detailed construction of such a subshift; it is done by a “stan¬ 
dard method” e.g., the Oxtoby technique, see [D]). Let 7 r : A' —► G denote the 
maximal equicontinuous factor map onto the underlying adding machine and let A 
be the Haar measure on G. Then, by Theorem |2.8| (and the explanations following 
that theorem), there is a set G' C G with A(G') = 1 which satisfy the following two 
conditions: 

( 1 ) all points x in the fiber 7 r —1 (< 7 ) agree along a common periodic part whose 
density equals 1 — d, 

( 2 ) as x ranges over 7 r —1 (^), all possible {— 1 , l}-valued sequences occur along 
the aperiodic part of x. 

In particular, if we arrange that d > 1- % (~ 0.7) (which is easily done within 

the “standard method”) then, for every g £ G' and x £ TT~ 1 (g) the set A = 
Aper(x) fl {n > 0 : £t(n) 7 ^ 0} has positive lower forward density at least do = 
d + -3 — 1 > - 3 . There exists a point x 0 £ 7 r~ 1 (g) such that for n £ A it equals 
/x(n). It is obvious that (even in the “worst case scenario”, when x(i) = — fi(i) and 
|/x(z) | = 1 everywhere on the periodic part of x) we still have 


( 6 . 1 ) 


1 x a 

liminf — 7 Xo(i)^(i) > do — (1 — d) > 0. 

n n z —' 

i=l 


Next, we will show that points Xq as constructed above (satisfying (6.1)) lie 
densely in X. Consider a basic open set U in X, i.e., a cylinder corresponding to 
a block B £ {—1, l } 2fe+1 occurring in X at the coordinates [— k, k\. By minimality, 
the same block occurs (perhaps at a different place) in the generating Toeplitz 
sequence, which implies that the same block occurs somewhere in the periodic part 
of every element of X , in particular in an element x £ ■n~ 1 {G'). Notice (directly 
from the definition) that the set G' is invariant (equivalently, t:~ 1 {G') is shift- 
invariant). Thus, by an appropriate shifting, we obtain a new point x such that B 
occurs in x at the coordinates [— k, k] and still belongs to the periodic part of x , 
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and g = 7r _1 (:r) belongs to G'. Using this particular g in the above construction of 
Xq we produce the point lo such that Xq [—k, k] = B (xq belongs to the same fiber 
as x and thus agrees with x along the periodic part, which includes the coordinates 


[— k, fc]). In other words, we have constructed a point Xq £ U satisfying (6.1). 

Next we observe that if we weaken (6.1) by requiring that the upper limit is 


larger than or equal to a positive e < do ~ (1 — d), then it holds on a residual set. 
Indeed, we can write 


1 n 1 

: limsup — x(i)/j,(i) > e > C 

n n . 

i=i ) 

f| U (* : ^ £*(*)/*(*) >ej c 

f 1 , n ^ 1 

< x : lim sup — x(i)/j,(i) > e > . 


ra> 1 n>m 


The first set contains the dense set of points satisfying (6.1), the middle set is of 
type Gs (hence it is a dense Gs) and thus the last set is residual. The example is 
completed. ■ 


Since Toeplitz sequences form a residual subset inside a Toeplitz subshift, we 
conclude that 

Corollary 6.2. There exist (irregular) Toeplitz sequences weakly correlated with 
the Mobius function. 


We remark, that the Toeplitz subshift of the above example has positive entropy 
(equal to din2), hence it stands in no collision with the Sarnak’s conjecture. 


6.2. Strong failure. The following example is replicated from [AKLI . 

Example 6.3. There exists a one-sided Toeplitz sequence strongly correlated with 
the Mobius function. 


We begin by describing a general scheme (used in |AKL| ) of producing a one-sided 
Toeplitz sequence from another symbolic sequence. Let y = (y n ) n >i be a one¬ 
sided sequence over a finite alphabet A. Let H = (jpf.) be a scale of an adding 
machine such that p\ > 3. Since Pk+i > 2 pk for every fc, this condition implies 
that p = J2 k >i TH < 1- deHne the associated one-sided Toeplitz sequence x as 
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follows: 

Vi! n = 1 mod pi 

y 2 , n = 2 mod p 2 


y P c 

n = pi mod p Pl 

Ypi+iJ 

n = pi + 2 mod p Pl +\ 

y P i+2 ’ 

n = p\ + 3 mod p Pl+2 

y2 P i-i> 

n = 2pi mod p 2pi -1 

Y2pi 5 

n = 2pi + 3 mod p 2pi 


We refrain from further detailed listing, as it becomes too complicated. The simple 
rule behind the scheme is that y k is placed at the first position available after steps 
1, 2, — 1 and then it is repeated periodically with the period p k . This concludes 

the description of the scheme. 

For further considerations, it will be convenient to highlight, for each k, the first 
placement of y k in x y (it is shown in boldface, while its further periodic repetitions 
are printed in the normal font). While reading the following text and diagrams it 
is important to distinguish between boldface symbols y k and normal font symbols 
y k . The diagram below shows the filling scheme in case p k = 3 k with the boldface 
terms marked. 


yiy2y3yiy4y5yiyeyTyiy2y8yiy9yioyiyiiyi2yiy2yi3yiyi4yiByiyi6yiTyiy 2 y3yiyiayi9yiy2o... 


The authors of jAKLj show that the lower density of the boldface symbols (which 
they call initials) is at least 1 — p (combining this with Lemma 7.1 (3) below we 
see that in fact these symbols have density 1 — p), which can be made arbitrarily 
close to 1. They select y so that x y (n) = p(n) whenever x y (n) is a boldface 

symbol. If 1 — p > 1-% then, for the same reasons as in (6.1) (with 1 — p in 

the role of d) 1 they obtain that x is strongly correlated with p. Of course, in view 
of Sarnak’s conjecture, one is obliged to compute the topological entropy of the 
generated Toeplitz subshift (at least to check whether it is positive). We will do so 
in Sectional ■ 


7. Properties of the scheme 

We are interested in properties of Toeplitz sequences obtained through the above 
scheme for general sequences y. In particular, we would like to know whether 
positive entropy follows automatically from positive entropy of the orbit closure Y 
of y. As we soon show, the answer is negative. This is quite unfortunate, because 
it forces us to estimate the entropy of the example of 1 AKL] using tedious methods 
adapted the the particular example. 

So, consider a general sequence y and the associated Toeplitz sequence x y with 
the boldface symbols marked. The following lemma addresses the distribution of 
the boldface symbols in x y . The statement (5) will be used immediately in Example 
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7.3 while statement (4) only in Section [8] Statements (1) and (2) are necessary to 


prove (4), while (3) is just a digression noted in passing. 


Lemma 7.1. Let z be the {0, 1 }-valued sequence given by the rule z{n) = 1 •<=>■ 
x y (n) is a boldface symbol. Then 

(1) The frequency of zeros in the block B ki o = z[l,p k ] converges to p from 
below, as k — >■ oo. 

(2) For every k every block of the form B kJ = B kJ [1 ,p k ] = z[jp k +1, (j + l)p fc ] 
(n > 0) can be obtained from B k $ by only replacing some 1 ’s by 0 ’s. 

(3) The symbols 0 in z have lower Banach density p. 

(4) Given e > 0 there is a S > 0 and no £ N, such that, for any n > no, the 
cardinality of different blocks B of length n, appearing in z and in which 
the frequency of 0 ’s is at most p + S, does not exceed 2 ne . 

(5) For every natural m, z contains a block consisting of m single symbols 1 
separated by blocks of zeros of lengths at least m. In particular, the upper 
Banach density of zeros in z is 1. 


Remark 7.2. (3) and (5) imply that the subshift generated by z is not uniquely 
ergodic; at least one invariant measure assigns to the cylinder of 0 the value p and 
at least one - the value 1 (perhaps there are more possibilities). (4) implies that 
every measure of the first kind has entropy zero. We have not verified whether this 
subshift has topological entropy zero (regardless of the scale H = (p k )). 

Proof. For (1) it suffices to observe that the frequency of zeros in B kfi equals 

y ( rn. _ 1 ) = (y 

Pk jL^ K p k ’ J \Z-^ Pk J P>= 

k' <k k' <k 

Indeed, in step k’ < k , in x y [l,p k \ we have placed symbols, of which one was 
boldface. Hence the formula. 

For (2) note that B kt o{i) = 0 if and only if x y (i) is a normal font (i.e., repeated) 
symbol y k , for some k'. This is possible only when p k i < p k . But then p k is a 
multiple of p k ' which implies that B k j(i) = 0 for every j. 

Clearly, (1) implies that the lower density (and lower Banach density) of zeros is 
at most p. By (2), the frequency of 0’s in any B k j may only be larger than that in 
B k o (which is close to p). Every sufficiently long block B in z is a concatenation 
of the blocks B k j and negligibly small prefix and suffix, so the frequency of 0’s in 
B is not less than p minus a negligibly small error term. This proves (3). 

For (4) we argue as above: every block B of large length n, after removing 
negligibly small prefix and suffix, becomes a concatenation of the blocks B k j (with a 
large parameter k). By (2), this concatenation can be viewed as a periodic repetition 
of B k: o with some l’s replaced by 0’s. But the number of replaced symbols 1 cannot 
essentially exceed nS, otherwise the overall frequency of 0’s would be too large. Such 
a replacement can be performed in approximately e rt (-‘ 51n<5 -( 1 -' s ) in(i-5)) different 
ways, which, for an appropriately small 5, is smaller than 2 rae . 

For (5), we will need the following (somewhat lengthy) 

Claim. For every m > 1, after some number k m of steps of filling x y (i.e., after 
having placed the periodic repetitions of y 1 ,... ,y k ), x y starts with the following 
configuration (later referred to as C): a continuous entirely filled block (with both 
boldface and normal font symbols) ending with the boldface y km followed by a 
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single unfilled coordinate, next a continuous block filled with normal font terms 
followed by a single unfilled position, next again a continuous block filled with 
normal font terms followed by a single unfilled position, and so on. The continuous 
filled blocks (including the first one) have strictly decreasing lengths and there is 
in of them (see the diagram below for to = 6, most of the indices are omitted). We 
do not require that the last unfilled position is single (it may be followed by more 
unfilled positions). 

yiyyyyyyy fcm nyyyyyynyyyyyDyyyyDyyynyyn 

Proof of the Claim. For m = 1 the condition is fulfilled after k\ = 1 steps, so 
the induction starts. Suppose the claim holds for some m > 1. The pattern C 
is repeated periodically with the period pk m and the repetitions cannot overlap 
(because the lengths of the filled blocks are all different). This implies that the 
pattern C is contained in x y [l,Pk m \- Let us move to the first repetition of the 
pattern C further to the right (call it C'). It starts at the position p km +\ and clearly, 
here all symbols are printed in normal font. Notice that at least two preceding 
positions: p km and p km — 1 are not occupied (because p km = 0 mod pk and p km — 
1 = pk — 1 mod pk for any k < k m , while the positions filled with the symbol y k 
have values mod Pk positive and much smaller than pk ). Now we perform the 
construction steps k m + 1, k m + 2, etc., of filling in x y , until we fill the position 
Pk m ~ 1 (with a boldface symbol y k , for some k' > k m ). Notice that the pattern 
C' reaches to at most the position 2 pk m , which is smaller than Pk m +i + 1, so the 
repeated (normal font) symbols added in these new steps fall to the right of C' 
(i.e., they do not affect it). In this manner we fill all the unfilled positions within 
x y [l,pk m — 1] creating (together with C') a pattern as required for m + 1 in the 
induction (with k m+ \ = k'). Note that the new initial continuously filled block 
x y [l,pk m — 1] has length p km — 1 larger than or equal to the length of the pattern 
C' (perhaps without counting its last empty cell), in particular, for to > 1, it is 
strictly longer than the first completely filled block of C . For m = 1 this also 
holds, because, in this case, the length of the first (and unique) filled block of C' is 
1, while pi — 1 > 1. The claim is thus proved. 

In the following construction steps, the unfilled positions in the pattern C are 
filled with the boldface symbols y km+ \,y km+ 2 ,... ,y km + m - Because the lengths of 
the separating normal font blocks strictly decrease, and there is m — 1 of them, 
the first one has length at least to — 1, the next one to — 2 and so on. Thus, 
taking for simplicity m to be even, we obtain that each of the boldface terms 
yfc TO+ i,yfc TO+ 2 ,... ,yfc m +m is followed (and preceded) by a block of normal font 
symbols of length at least . Renaming as to ends the proof. □ 

It follows from the construction (or we can easily arrange it by choosing a sub¬ 
sequence) that fc m +i > k m + to for each to > 1. 

Example 7.3. There exists a one-sided symbolic sequence y such that its orbit 
closure Y has positive topological entropy and supports many invariant measures, 
yet the associated Toeplitz subshift X y (the orbit closure of x y ) is strictly ergodic 
with entropy zero. 

We just need to decide about the contents of the sequence y. Let A = {0,1}. For 
each to > 1 we let y[k m + 1, k m + to] be a block A m and y[k m + to + 1, k m + i] 
be the block consisting entirely of zeros. We arrange that the sequence of blocks 
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(A m ) m >i generates a positive entropy subshift Yq with many invariant measures 
(for example, the full shift on two symbols). It is clear that the orbit-closure Y of 
y contains Yq, hence has positive entropy and many invariant measures. 


Let us ignore “accidental” periodic repetitions of symbols in x y . This is to say, 
we will denote by Per k (x y ) the set of positions of the jv-periodic repetitions of the 
symbols y fe , for k! < k. The density of so defined Per k (x y ) is J2k'=i 

It follows from the general facts concerning Toeplitz subshifts, that if p is an 
invariant measure on X y , then p-almost every x € X y has the “non-accidental” 
periodic part of density p < 1, and the remaining part (which we denote by Aper(x), 
although at the moment we only know it contains the true aperiodic part). Clearly, 
Aper(cc) is infinite as it has density 1 — p. 

Suppose Aper(x) contains two positions n and n + m such that x{n) = x{n + 
to) = 1. This implies that in x y there are infinitely many positions n' such that 
x(n') = x(n' + to) = 1 and n' and n' + m both belong to arbitrarily high periodic 
parts. This is to say, x y (n') = y k , (or y k >) and x y (n' + to) = y k ,, (or yk"), where 
k',k" are arbitrarily large, for instance larger than both k and k m and such that 
Pk',Pk" > to. Suppose k! < k" (the other case is symmetric). Shift (if necessary) 
the window [n', n! + to] to the left by a multiple of p k > so it starts with the boldface 
symbol y k /. Say, this is now [n", n" + to] . The position n" + to cannot be occupied 
by y k /// with k < k 1 because then n! + m would also be occupied by the same y k m 
(while it is by y k „ or y k n with k" > k'). This implies that n" + to is occupied by 
some y k m or y k >" with k!" > k'. But in such case, since m is smaller than p k ">, it 
must be the first occurrence, i.e., y k "’. Since y k i = 1, k' must belong to an interval 
[k m ', k m ' + itt-'] for some m! and since k’ > k rn , m! must be larger than or equal to 
to. This implies that the occurrence of y k ’ in x y (it occurs as x y {n")) is followed 
by a block of at least to normal font symbols. This is a contradiction since we have 
just shown that x y {n" + to) is a boldface symbol. 


We have proved that if x has an infinite aperiodic part, this part is filled with 
zeros except perhaps one 1. This immediately implies that X y has entropy zero 
and is strictly ergodic (in fact, it is isomorphic to the odometer, like the system of 
Example 5.1). 


Although the above statement already captures the most important properties 
of the Toeplitz subshift X y , we have not yet guaranteed that 

(1) the sequence x y is irregular with the density of a “true” aperiodic part 
equal to 1 — p, 

(2) the odometer (G, r) is a factor of X y . 

All these features must be arranged separately, by delicate modifications of y : yet, 
which do not destroy what we have already achieved. We will only outline what 
needs to be done, skipping the tedious and not very interesting details. 

We must realize that the construction steps of filling in x y corresponds to suc¬ 
cessively defining the associated semicocycle as constant on some clopen subsets 
C k C G. The positions of these subsets are determined by the scheme; they form a 
dense subset of G and have jointly the Haar measure p. 

For (1) we need to assure that the cocycle is discontinuous at every point of the 
complementary set D C G. This can be done by making sure that we assign at 
least two different values in every neighborhood of every point of D , which can be 
achieved by modifying (if necessary) the values of y along a very sparse subsequence, 
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so sparse that it would not affect other properties. Notice that every neighborhood 
of every point in D contains infinitely many sets C so we can choose an arbitrarily 
sparse subsequence of these sets which visits all such neighborhoods. 

Likewise, for (2) we need the semicocycle to be invariant under no rotation. For 
this is suffices that we arrange a discontinuity point “unlike any other”. This can 
also be done by very sparse modifications of y. ■ 

8. Entropy of the Example 16.31 

We have eliminated the possibility of an “automatic” proof that the entropy in 
Example |6.3| is positive just based on the fact that /r generates a subshift with 
positive topological entropy. On the other hand, in view of Sarnak’s conjecture, we 
are obliged to check positivity of its topological entropy. 

Proof of positivity of the entropy of Example \ 6.S\ For short, we will call the squares 
of prime numbers the p-squares. They will be denoted by d\, d- 2 , d%, 

Let B denote an arbitrary block appearing in the subshift generated by \fi\. Let 
n denote the length of B , which we assume is large. Our goal is to indicate a place 
(an interval of n consecutive coordinates) where B occurs in \fi\ and in the Toeplitz 
sequence x y the number of boldface symbols is close to n(l — p) (i.e., nearly realizes 
the upper Banach density of such symbols). Recall that p is the sum of the inverses 
of the periods pk and is smaller than \. First we will argue that finding such places 
(for all long enough blocks B) suffices for positivity of the topological entropy of 

-V.„ 

Indeed, let C denote the block appearing in x y over this interval. Then B 
can be reconstructed knowing C and two additional data: the positions of all the 
normal font symbols in x y in the considered interval and the contents of | /x | at these 
positions. Since the number of normal font symbols is not larger than n(p + 5), 
Lemma |7.1| implies that there are at most 2 ne possibilities as to how the normal 
font symbols are distributed, and then there are at most 2 n ^ p+s ' > possibilities as to 
their contents in \pi\. This produces the estimate 

#{B} < #{C} ■ 2 ne ■ 2 n{p+s) , 

where ff{B} and #{C} denote the cardinalities of blocks of length n in |/r| and 
in x yi respectively. Since \fi\ generates a subshift of entropy ^ log2, i.e., #{B} is 
nearly 2 6 ”/ 71 ' 2 , the cardinality #{C} is (ignoring the small terms) nearly 2 T d 6 / 7r “~ p ), 
which yields positive topological entropy of X y whenever p < (while we have 
assumed it is smaller even than half of that number). 

So, we focus on finding an interval, as specified at the start of the proof. Find 
an interval / of length n where B occurs in \fi\ (there is such). Positions of zeros 
in I can be divided in two classes: first class - coordinates divisible by any of the 
p-squares d\,...,d,L, where is the largest p-square smaller than n, and second 
class - the remaining ones (which are divisible by larger p-squares). Note that for 
each p-square larger than or equal to n only one of its multiples can occur in I. 
Now by the Chinese Reminder Theorem (see e.g. m ,Clrap. I) we can shift the 
interval I (and call the shifted interval I'), so that 

(1) the shift is by a multiple of d\d 2 ■ ■ ■ dx, where dx is the largest p-square 
smaller than An 2 , and 
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( 2 ) zeros of the second class become (after shifting) divisible by some a pri¬ 
ori selected large p-squares ei, ...,e q (the choice of these p-squares will be 
specified in a moment). 

The zeros of the first class appearing in /x over the interval I' are precisely the 
shifted zeros of the first class over I. All zeros of the second class occurring over 
I correspond (via the shift) to zeros of the second class occurring over I', but the 
later interval can have more zeros of the second class (some new zeros divisible by 
p-squares larger than 4n 2 and different from ei,e 2 , ■ ■ ■ ,e q can occur here). The 
configuration of zeros inherited from / is repeated in /x in every interval I" along 
an arithmetic progression starting with /' and advancing with step 


M = d± ■ ■ ■ cLk ■ e\ ■ ■ ■ e q , 

(this need not be the smallest step, just one which is sure). From now on we 
will observe only the intervals I" appearing along this progression. In every such 
interval the additional zeros (if there are any) must be divisible by p-squares larger 
than or equal to 4n 2 . Using Lemmas 8.2 and 8.3 provided at the end of the paper, 
one easily obtains, that the percentage (among the observed intervals) of intervals 
where there are any additional zeros does not exceed 


n 


‘-IK 1 


j>K 



< 


Vd- 


K+l 



In other words, in at least around 1/2 of the observed intervals in |/x| there occurs 
precisely the block B. 


It now suffices to arrange that majority (a bit over 1/2 is enough) of these 
intervals are such that in the Toeplitz sequence x y there are nearly n{l — p) boldface 
symbols. 

Recall that H = (pk) is the scale used to construct x y , and for each k > 1, 
Pk+i is an essential multiple of pk (at least times 2). We are going to mark three 
“important points” Ai, A 2 , ^3 on the axis of the parameter k (these points bear 
hidden dependence on n, not visible in the denotation). 

k -[: Let be the largest k such that pk < n. Notice that pk > 2 fc implies 
jfei < Inn/In 2 , which (for large enough n) is smaller than [n/lnn]. 


Next “important points” require auxiliary functions. Let Nk be the number of 
prime factors (in the meaning sum of their multiplicities) of the largest common 
divisor of pk and d\ ■ ■ -dx- The function k 1 —> Nk is nondecreasing and becomes 
constant before it reaches 2 K + 1. This implies it eventually lies below the line kj 2. 

k 2 : Let k 2 be the largest k (if such exists), for which Nk > k/2. Notice that 
Nk < 2 K implies k 2 < 4A", which (for n large) is small compared to n (equal 
to four times the number of primes smaller than 2 n, i.e., approximately 
8n/ln2n). At this point we agree that if k 2 < 8n/ln2n (or does not exist 
at all), then we put &2 = [8n/ln2n]. In particular, this guarantees that 
k 2 > fci. 

k 3 : Let &3 be some place, not smaller than k 2 , where the function Nk has 
already reached its maximum. At this moment we define the numbers 
ei,..., e q , to be relatively prime with Pk 3 + 4 q - We can do it now, because the 
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particular values of e\ ,..., e q have not been used in defining the preceding 
points. 

Now consider a similar auxiliary function Mk defined analogously as Nk with M 
in place of d-\ ■ ■ ■ (Ik- Note that to the right of + 4 q the function Nk does not 
grow, while Mk may increase by at most 2q (possibly even in one jump), however, 
thank to the specific choice of ei,..., e q , this function will never again cross the line 
k/2. 

The “important points” are shown on the figure below 



FIGURE 4. The function Nk is shown in red, and Mk in green. 


The positions of normal font symbols in x y (equivalently, of zeros in z) we divide 
into arithmetic progressions with periods pk (along such a progression x y (i) = y fc ) 
and we classify these progressions in three groups. 

( 1 ) k = l,...,fci 

( 2 ) k = ki + 1,..., fc 2 

(3) k > k 2 

(The number ks is needed only to define ei,..., e q and is not used to separate the 
groups.) We will now analyze the progressions according to this classification. 

1. Each progression from the first group occupies in every interval of length n 
approximately n/pk positions (at most 2 n/pk) for the largest k in the group). The 
union of these progressions occupies not more than n(p + e) (for large enough n). 

2. Every progression from the second group is represented in every interval of length 
n by at most one coordinate. Jointly these groups occupy at most 4 1\ positions, 
which does not essentially exceed 8n/ln2n, i.e., negligibly little compared to n. 


So far we have been estimating the number of normal font symbols in an interval 
of length n , and so far it came out close to np (i.e., as we need it). From now on we 
will estimate the percentage of the “observed” intervals I" which are disjoint from 
the progressions belonging to the third class. 

3. Consider a k in the third group. The period pk has with M no more than 
k/2 common prime divisors. However, pk has at least k prime factors, eac h eq ual 
to at least 2. This implies that pk/gcd(j>k, M) > 2 k / 2 . Applying Lemma 8.1 we 
conclude that the percentage of intervals /" intersecting the progressions from the 
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third group does not exceed nJ2k>k 2 2 _fc ^ 2 - Since &2 > 8nlog2n, this estimate is 
arbitrarily small for large n. 

To summarize, we can arrange that more than half of the observed intervals I" 
do not intersect any of the progressions from the third and fourth groups. In such 
intervals |/i| reads B. Combining this with a previous estimate we obtain that there 
exist intervals in which both \fi\ reads B and in x y there are nearly n(l — p) boldface 
symbols, as required. This completes the proof. □ 

Now the missing lemmas. 

Lemma 8.1. Consider a collection of arithmetic progressions of natural numbers 
Aj = {kpj + rj : k £ N} ; where rj > 0 and pj/j —► oo as j oo. Let M be a 
natural number and r a nonnegative integer. The upper density of the set 

{k £ N : kM + r £ [J Aj} 

3> 1 


is less than or equal to J2j>i jr, where p} = gcd ^f j M) for j > 1. 

Proof. Given n let J n denote the maximal number j such that nM + r > pj. Note 
that for any n and j: 

Tl 

life € N : k < n, kM + r £ A,}| < — + 1 

Pj 

and the set on the left hand side is empty if j > J n . It follows that 

L\{k £ N : k < n, kM + r G U J > 1 Aj}\ = L\{k £ N : k < n, kM + r £ |J^" 1 Aj}\ < 
n |{fc G N : k < n, kM P r £ Aj} \< l fi:) + Jn) 

Thanks to our assumption on pj, ^ —> 0 as n —j oo, thus the assertion follows. □ 

Lemma 8.2. In the family of all subsets of N (or of Z) which have well defined 
density, density can be viewed as a finitely additive probability measure. Then 
any finite collection of periodic sets with relatively prime periods is stochastically 
independent. 

Proof. Since every periodic set with period p decomposes as a disjoint union of 
finitely many arithmetic progressions with step p, it suffices to prove the lemma for 
arithmetic progressions (notice that the density of a progression with step p equals 
^). So, let Ai, A 2 ,..., Ak be arithmetic progressions with steps Pi,P 2 , ■ ■ ■ ,Pk- We 

need to show that the density of their intersection equals ---. This, however 

is obvious, because due to the relative primeness, this intersection is an arithmetic 
progression with step P 1 P 2 ■ ■ - Pk- D 

Lemma 8.3. For any k > 1 we have 

na ~ it - 1 - vs:- 

j>k 
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Proof. Recall that dj = qj, 

Ifc 1 _ = IR ~ 

j>k j>k 


where qj denote the consecutive primes. Thus 


N N 

) = lim TT (1 —4) > lim TT (1-4) = 

j—k n—q k 


lim Qk - 1 • 1 = 1 - i = 1 - 

N—>oo q^. N Q k 


l 

yfa 


□ 
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